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Abstract — Renewable energy sources can be attached to sensor 
nodes to provide energy replenishment for extending the battery 
life and prolonging the overall lifetime of sensor networks. For 
networks with replenishment, conservative energy expenditure 
may lead to missed recharging opportunities due to battery 
capacity limitations, while aggressive usage of energy may result 
in reduced coverage or connectivity for certain time periods. 
Thus, new power allocation schemes need to be designed to 
balance these seemingly contradictory goals, in order to maximize 
sensor network performance. In this paper, we study the problem 
of how to jointly control the data queue and battery buffer to 
maximize the long-term average sensing rate of a wireless sensor 
network with replenishment under certain QoS constraints for 
the data and battery queues. We propose a unified algorithm 
structure and analyze the performance of the algorithm for all 
combinations of finite and infinite data and battery buffer sizes. 
We also provide a distributed version of the algorithm with 
provably efficient performance. 1 

I. Introduction 

Wireless sensor networks have been widely used in mon- 
itoring [1], maintenance [2], and environmental sensing [3]. 
The lack of easy access to a continuous power source and 
the limited lifetime of batteries have hindered the wide-scale 
deployment of such networks. However, new and exciting 
developments in the area of renewable energy [4] [5] provide 
an alternative to a limited power source, and may help alleviate 
some of the deployment challenges. These renewable sources 
of energy could be attached to the nodes and would typically 
provide energy replenishment at a slow rate (compared to the 
rate at which energy is consumed by a continuous stream of 
packet transmissions) that could be variable and dependent on 
the surroundings. 

Energy management in networks equipped with renewable 
sources is substantially and qualitatively different from energy 
management in traditional networks. For example, conserva- 
tive energy expenditure could lead to (i) very long delays 
because the energy is not being fully used to transmit at high 
enough data rates, and (ii) missed recharging opportunities 
because the battery buffer is full. On the other hand, an over 
aggressive use of energy may lead to lack of coverage or 
connectivity for certain time periods. Further, if the battery of a 
node discharges completely, it could be temporarily incapable 
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of transferring time-sensitive data. This may have undesirable 
consequences for many applications. Thus, new techniques and 
protocols must be developed for networks with replenishment 
to balance these seemingly contradictory goals. 

In this paper, we first consider a single link communication, 
in which the transmitting node has a data buffer that holds 
the incoming variable-rate sensing data and a battery buffer, 
which is being replenished at a variable rate. In our model, we 
allow any combination of finite and infinite data and battery 
buffer sizes. If the data buffer size is infinite, the concern is 
the stability of the data queue, while for finite data buffer, 
excessive data losses is the undesirable event. Likewise for 
the battery buffer, frequent occurrences of battery discharge 
should be avoided. We investigate the problem of maximizing 
the long-term average sensing rate, subject to the constraints 
on the stability of the data queue (or the desired data loss ratio 
when the data buffer size is finite) and the desired rate of visits 
to zero battery state. We provide a simple and unified joint rate 
control and power management framework, and show that the 
performance of our scheme is close to optimum. 

We then extend our algorithm to a multihop network setting 
with multiple source destination flows. We generalize our 
joint power and rate rate control algorithms along with a 
multihop routing scheme. We assign each link a weight that 
is a function of the data queue size, the battery state, and the 
chosen power level for that link. Then, we schedule a subset 
of links at a given point in time, depending on their weights 
using maximum weight matching or maximal matching, under 
the primary (or node exclusive) interference model, that has 
been shown to be a good model for Bluetooth of FH-CDMA 
systems with perfect orthogonal spreading codes and low 
power-spectrum density and used widely in wireless network 
control [6] [7]. We extend the performance analysis and show 
that our algorithm achieves a close-to-optimal performance in 
the multihop setting as well. 

While the problem of energy management in sensor net- 
works has seen considerable attention, there have been few 
works [8], [9], [10], [11], [12], [13], [14], [15] that also include 
energy replenishment. In [8], the authors consider the problem 
of dynamic node activation in rechargeable sensor networks. 
They provide a distributed threshold policy that achieves a 
performance within a certain factor of the optimal solution 
for a set of sensors whose coverage area overlap completely. 
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In [9], the authors study the problem of computing the lexico- 
graphically maximum data collection rate for each node such 
that no node will be out of energy. In [10], the authors consider 
the problem of energy-aware routing with distributed energy 
replenishment. They provide an algorithm that achieves a log- 
arithmic competitive ratio and is asymptotically optimal with 
respect to the number of nodes in the network. Prior work on 
power allocation for wireless networks without replenishment 
has been widely studied, e.g., [16], [17]. In [17], the authors 
assume that the data buffer is large enough so that packet loss 
does not occur and provide a dynamic programming based 
solution. In [16], the authors develop approximate algorithms 
to minimize the average allocated power, or maximize the 
throughout given average power constraint, and at the same 
time keep the data queue stable. The approximation improves 
at the cost of increasing the data queue length and queueing 
delay. Most of these works assume a constant energy supply, 
i.e., there is no battery issue. The idea of [14] and this paper 
are motivated by [16]. We both model energy replenishment 
and consider jointly managing the data and battery buffers. 
This coupling between the data and battery buffers is what 
makes the problem notoriously difficult to solve using standard 
optimization based approaches. For example, unlike [16] that 
utilizes the fact that a static allocation policy is optimal, it is 
not even clear whether a static policy would even be optimal 
in our setting. Specifically, with a battery buffer, there is an 
additional energy constraint that the allocated energy should 
be within the battery state, and this constraint is even more 
difficult than the average power constraint. In [14], the authors 
consider infinite data buffer and finite battery buffer sizes. 
They assume that the replenishing process is i.i.d (for non 
i.i.d process, they claim that a K slot analysis can be applied, 
which is complex and depends on the network size), and show 
that the probability of battery state being less than the peak 
power or close to the full battery state vanishes as the battery 
size grows, under an index policy. In this paper, we construct a 
framework that accommodates all combinations of finite and 
infinite data and battery buffer sizes by defining minimum 
number of virtual queues in a general format. In addition to 
the constraint on the stability of the data queue (constraint on 
the data loss ratio when the data buffer size is finite), we also 
have a constraint on the frequency of battery discharge. Rather 
than assuming an i.i.d replenishing process as in [14], we 
allow for a general replenishment process without assuming 
ergodicity and carefully design the virtual queues and explore 
the relations between actual queues and virtual queues to show 
that our algorithm is efficient. Further, we design a distributed 
algorithm in detail. 

The main intellectual contributions and challenges of the 
paper are as follows: 



Nonetheless, we are able to develop a simple and unified 
framework for all combinations of finite and infinite data 
and battery buffer sizes, and our algorithm is provably 
efficient. 

We extend the algorithm from a single-link case to 
a multi-hop network under node-exclusive interference 
model with practical settings and develop an efficient 
distributed algorithm. 



II. Single Link Model 



We first consider a single link control model in this paper, 
as illustrated in Figure 1. We assume a time slotted system 
and in time slot t, the amount of data available for the sensor 
node to sense is denoted by A(t), which is upper bounded by 
^4max, (0 < A max < oo). In the same time slot, the amount of 
data the node actually senses and places in the data buffer is 
R(t) . The amount of energy expended by the node at time t for 
data transmission is P(t), which is upper bounded by P pea k, 
(0 < -Ppeak < oo) and the achievable data rate at that power 
level is fi(P(t)), where we assume ^i(-) to be monotonically 
increasing, reversible and differentiable on the half real line 
3? + 1J{0}. The node has a battery of size Bi, (either < 
oo or Bb — oo) with zero initial battery state. We let r(t) 
denote the amount of replenishment energy arriving to the 
node at time t. The energy state of the battery and the data 
state of the data buffer at time t is given by qb(t) and qd(t), 
respectively. The data buffer has size Bd (again, either Bd < 
oo or Bd = oo). Under this setting, we describe the objective 
in the following section. 




' data queue 
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Fig. 1. Single Link Control Model 



We formulate a sensing rate maximization problem with 
QoS constraints on both the data and battery queues. Due 
to the coupling between the battery and data buffers, a 
stationary policy could be suboptimal, hence traditional 
resource allocation techniques do not directly apply. 



A. Problem Formulation 

Our general objective is to maximize the long-term average 
sensing rate, subject to the QoS constraints on both data and 
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battery queues: 

(A) 

s.t. q d (t + 1) = min 



1 T_1 

maxliminf — > R(t) 
P.R T-^oo T t-*' w 
t=o 



M (P(i))) + + R(t),B d 



q b {t + l)= min - P(t) + r(t),B b 

0 < R(t) < A(t), 
0 < P(t) < min [g 6 (i),P P eak 

Po < Vo, 



1 T_1 

limsup - V < oo (B d 
Pd < % (-Bd < oo), 



oo), 



or 



(1) 
(2) 
(3) 
(4) 
(5) 



(6) 



where (•)+ = max[-, 0], R = {R{0), R(l), . . . , R{T- 
is the actual sensing data vector, P = {P(0), P(l), . . 
1), . . .} is the allocated power vector, and 



1),...} 
,P(T- 



Pd 



0, if lim inf r^oo ± YlJ=o R(t) = 0 
limsup T _ 7^=0 D(t) 



otherwise 



T-l 



p 0 = limsup^ 7 0 (i) 



(7) 



(8) 



t=o 



are the long-term data loss ratio with an upper bound r\ d , 
and the frequency of visits to zero battery state with given 
threshold r\ Q respectively, where 



D(t) = (jq d (t) - »(P(t))) + + R(t) - B d ^j + 

-^o(^) — 1 battery hits zero state in slot t ~ 



(9) 



if P(t) < q b (t) 
otherwise 



(10) 



are the amount of data loss in slot and the indicator that the 
battery discharges completely in time slot t, respectively. Note 
that, we do not assume ergodicity of the system parameters, 
but if they are ergodic, then p Q represents the actual probability 
of a complete discharge event as t — > oo. 

In Problem (A), Constraints (1) and (2) describe how the 
data and battery queues evolve, respectively. Especially, if 
B d = oo, (1) can be simplified to 

q d (t+l)= ( gd (t)-/i(P(f))) + + P(t), (I') 

and if B b = oo, (2) can be simplified to 

q b (t+l) = q b (t)-P(t) + r(t). (2') 

Constraint (3) bounds the actual amount of sensed data R(t) 
by the amount of available data A(t) in slot t. Constraint 
(4) states that we cannot oversubscribe the energy that is 
unavailable in the battery nor can we exceed the peak power 
level. Constraint (5) is the battery QoS constraint r\ 0 of 
the desired battery discharge rate. Constraint (6) is the QoS 



constraint for data queue: if B d = oo, we need to keep the data 
queue stable, and if B d < oo, the data loss ratio is required 
under a given threshold rj d . 

In this paper, our purpose is to develop a simple algorithm, 
which performs arbitrarily close to the optimal performance. 
To achieve that purpose, we do the following: 

• define virtual queues for both the data and battery buffer 
to avoid the difficulties involved in dealing with the data 
loss and battery discharge probability directly. We show 
that keeping the virtual queues stable ensures that the 
constraints on p d and p a are met. 

• design a power allocation scheme based on simple index 
policies and show that our scheme keeps both virtual 
queues stable and at the same time performs arbitrarily 
close to the optimal performance. 

• generalize our algorithm to the multihop scenario and 
develop distributed algorithms. 

B. Virtual Queues 

We define q d and q b as the virtual data and battery queues. 
The virtual queues evolve according to the following Lindley's 
queue evolution equations: 

Qd(t + 1) = ((&(*) - VdR(t)) + - n(P(t)) + R(t) + 7(i)) + , 
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(11) 

(t + 1) = ( (q b (t) -Vo) + + P(t) - r(t) + M(t) + 7 0 (t)) + , 

(12) 

where I(t) — (fi(P(t)) — q d (t)) + is the amount of transmitted 
idle packets when there is no enough data to transmit using 
the allocated energy, M(t) = (q b (t) - P(t) + r(t) - B b )+ is 
the amount of missed replenishing energy due to full battery 
when B b < oo, and l 0 {t) is defined in Equation (10). Note 
that if B b = oo, then M(t) = 0 and Equation (12) reduces to 

«6(* + 1) = (fat) - Vo) + + P(t) -r(t) + i„(t)) + . (13) 




Fig. 2. Battery Queue and Virtual Battery Queue 

Figure 2 shows the relationship between the actual and 
virtual battery queues when the battery size is finite. The 
amount of change in q b (t) from time t to t+ 1 is P(t) — r(i) + 
M(t). Since the battery has a finite size, this term vanishes 
when averaged over an infinitely long period of time. Then, 
Po = limsupj^^ ^ J2t=o lo{t) and r/ 0 can be viewed as the 
long-term input and output rate of q b (t), respectively. Thus, it 
is reasonable to expect that q b (t) being stable implies p Q < r\ 0 . 

Without loss of generality, the initial state q b (0) and q d (0) 
can be set to be zero. The following proposition shows that 
if the virtual queues q d (t), q b (t) and the actual battery queue 
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qb(t) are all strongly stable, p d and p a are guaranteed to meet 
their constraints. 

Proposition 1: If the virtual queues q d {t), q b (t) and the 
actual battery queue q b (t) are both strongly stable, i.e., 



1 T_1 

limsup - V (q d (t) + q b (t) + q b (t)) < 



oo, 



t=0 



then p d < rjd and p a < r\ 0 . 

The proof of this result can be found in Appendix A. Next, 
we present our scheme. 

III. Joint Rate Control and Power Allocation 
Algorithm 

A. Algorithm 

The algorithm consists of two components: a rate control 
component and a power allocation component. Both compo- 
nents are index policies, i.e., the solutions are memoryless 
and they depend on the instantaneous values of the system 
variables. 

Rate Control (RC): 
We define 0 < V < oo to be the control parameter of our 
algorithm. Let Qd(t) — qd(t) when Bd = oo, and let Qd(t) = 
(1 — rid)qd(t) when Bd < oo. If Qd(t) < \, the transmitting 
node chooses to sense all the available data, i.e., R(t) = A(t). 
Otherwise, R(t) = 0. 

Power Allocation (PA): 
Solve the following optimization problem 



(14) 



where U(t) = {P(t) : 0 < P(t) < mm[q b (t), P pe - dk }} is a 
compact and nonempty set. Allocate P(t). 

The set, LT(t) of possible power allocations guarantees 
Constraint (d) on P(t) in Problem (A). If fi(-) is concave 2 , the 
objective function is a concave function of P(t). Consequently, 
PA solves a simple convex optimization problem in each time 
slot. The positive term Qd(t)p(P(t)) can be viewed as the 
utility of the allocated power P(t) and the term q b (t)P(t) can 
be viewed as its associated cost. When the control parameter 
V is chosen to be large, Qd(t) tends to be large according to 
RC, and PA tries to allocate higher power P(t) to increase the 
utility, whereas, when the virtual battery queue length q b (t) is 
large, PA avoids allocating a high amount of power to reduce 
the cost. Thus, this index policy of PA can be viewed as a 
greedy profit maximization scheme. 

Observation 1: From the objective function in Equation (14), 
Qd{t)rL^P{t)) can be viewed as the utility. When Qd(t) is 
large, i.e., the data queue or the virtual data queue is large, 
PA tends to allocate a larger power P(t), so that Qd(t) will 
decrease. 

Observation 2: Once the battery is discharged, i.e., q b (t) = 0, 
q b (t) tends to increase as can be seen in Equation (12) or (13). 

2 For instance, consider the additive white Gaussian noise channel 
capacity, fJ.(P) = log(l + P/No), where iVrj is the two sided noise power 
spectral density. 



Consequently, PA allocates a smaller power P(t), so that the 
discharge event happens infrequently. 

Observation 3: As q b (t) increases, q b (t) decreases, hence, PA 
tends to allocate a larger power P(t) to avoid battery overflow 
in advance. 

From Observation 2 and Observation 3, PA expends energy 
neither too conservatively nor too aggressively. 

B. Performance Analysis 

Recall that A(t) is the available amount of sensing data and 
R(t) is the actual amount of data the transmitting node chooses 
to sense. Clearly, using the rate controller, we make sure that 
the data queue remains within a certain bound. This has a 
positive effect on the battery as well, since a certain portion 
of the data packets are not allowed into the transmitting node. 
The natural question one would ask here is, whether our rate 
controller rejects too many packets in the first place to synthet- 
ically meet the constraints. In the following theorem, we show 
that this is not the case. Let A* = liminfV^oo ^ 2^2t=o R*(t) 
be the optimal objective value of Problem (A) for any given 
exogenous process {A(t), t > 0} and the replenishment 
process {r(t), t > 0}. In the following theorem, we show 
that the sensing rate achieved by RC and PA is asymptotically 
close to the optimal sensing rate A* as Bd and B b tends to 
infinity. The optimal value can be achieved arbitrarily closely 
by increasing T^.We use the notation y = 0(x) to represent y 
going to 0 as a; goes to 0. 

Theorem 1: If the following conditions hold: 

(1) /u(-) is concave on 3? + U{0}> an d i ts slope at 0 satisfies 2 
0 < P = m'(0) < oo, 

(2) 0 < r(t) < r max , for all t > 0, 

then the joint power allocation and admission control algo- 
rithm (with RC and PA) achieves: 



Qd(t) < 2 +A » 



9b(t) < /3 

Po < Vo, 
Pd < Vd, 



V 



Vt>0 (15) 

Vt>0 (16) 

(17) 
(18) 



1 



t=0 t=0 



where fi m . dx 
rate, and 



g(V,B d ,B b ) 



-Vo{l^ + P)-9{V,B d ,B b ), (19) 
M(-Ppeak) is the upper bound for the transmission 



0{ 
0( 



v 

-B d )+, 



(§V V -B b ) + 



if Bd < oo, B b < oo 
if Bd = oo, B b < oo 
if Bd < oo, B b = oo 



) + o( 



if B d < oo, B b < oo. 



The proof of Theorem 1 can be found in Appendix B. In 
Theorem 1, V is a finite tunable approximation parameter 
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that controls the efficiency of the algorithm. Observe Equa- 
tion (19), which compares the performance of our algorithm 
with that of the optimal solution of Problem (A), the term 
^o(Mmax + (3) captures the influence of battery outage, and it 
is small since the battery outage threshold r\ 0 is usually set 
to be very small to avoid network disconnection. Function 
g(V, Bd, Bt) represents the asymptotical property of the gap. 
If both battery and the data buffers are of infinite size, 
then the performance gap is identical to 0. Otherwise the 
performance gap is dictated by the finite (or the smaller) one of 
the buffers. One can observe that, by appropriately choosing 
the ^-parameter, we can make the performance gap decay 
inversely proportional to the buffer sizes. 

IV. Multihop Network Model 





Fig. 3. Multihop Network Model 

We consider a multihop wireless sensor network with N 
nodes and L links as illustrated in Figure 3. Each node 
n e N = {1,2, ...,N} is attached to power sources for 
replenishment. Let A^(t) and R^it) denote the amount of 
available data for sensing and the actual amount of sensed 
data, to node n that are destined to node e in slot t. We 
assume that each node n maintains an infinite data buffer 
with state q^ ,e {t) for flows destined to e, and also maintains 
a finite battery buffer with size B£ and state q%(t) (We focus 
on infinite data buffer and finite battery buffer in order to 
emphasize the difference from single link to multihop. For 
other combinations of data and battery buffer sizes, then 
extension can be made similarly without too much effort). 
Let r n (t) denote the replenishment at node n in time slot 
t. The transmit power is chosen to be Piif) over link In 
the formulation, we assume that the power the receiving node 
consumes to receive and decode the packet is identical to 
Pi(t) as well. The sole reason for this is simplicity and the 
generalization to the asymmetric case is straightforward (By 
defining a receiving transmitting power ratio, we can extend 
it to the general case with no technique difference). We use 
the node-exclusive interference model. Under this model, a 
node can only receive from or transmit to at most one node 
at any time slot. In each time slot t, with the assigned power 
Pi(t), the achieved data rate at link I is ni(Pi(t)) in that time 
slot, where the rate function fii(-) is a non-decreasing, concave 
and differentiable function satisfying /x;(0) = 0. Let p™ be 
the frequency of visits to the zero battery state for node n. 
Let fi„ and 6„ denote the set of directed links originated 



from node n and terminate at node n, respectively. We say 
P = [Pi(t) ■ ■ ■ PL{t)] satisfies the node-exclusive model if 
Pi(t) > 0 for some i e !1„U 6„, then P v = 0 for all 
I' G (0„U9„)\{i}. In a multihop network, we formulate the 
joint queue and energy management problem as follows: 



T-l 



(B) majliminfiE E Kit) 

P,R T^oo T ^ ^ 

s.t. P(t) satisfies the node-exclusive model, (20) 

# e (* + i) < (C e (t) E ri(W))) + + K(t) 

ien n 

+ E>?w))' n ^ e > 

iee„ 



(21) 



g"(t+l) =min 



</?(*)- E W)+r n {t),B£ 
zefi n ue„ 



Qb (*)> -Ppeak 



o < E ^ n 

iefi„ue„ 

N 

5>f(fl(t)) =h(W)), Kit) < A- n (t), 



e=l 



T-l 



limsup I- E <ld' e it) < °°: n + e, 



t=0 



(22) 
(23) 

(24) 

(25) 
(26) 



where r/™ is the desired upper bound for p™, P(t) is the 
power assignment vector for all links in slot t, P is the power 
assignment for all links over all time slots, and R is the actual 
sensing data vector for all node-destination pairs over all time 
slots. In Problem (B), the objective is to maximize the long- 
term average total sensing rate for all nodes destined to all 
destinations. 

In Problem (B), (20) is the interference constraint. Con- 
straints (21) and (22) describe how the data and battery queues 
evolve, respectively. Note that the destination node of each 
flow does not need to maintain a data buffer for that flow, as 
indicated in (21). Constraints (23) are the energy conservation 
equations stating that we cannot oversubscribe the energy that 
is unavailable in the battery nor can we exceed the peak power 
level. Constraints (24) are the rate conservation equations that 
bound the actual amount of sensed data R„it) by the available 
amount of data A^(t), and share the transmission rate of a link 
among all the destinations in slot t. Constraint (25) is the QoS 
constraint for data queue: we need to keep all the data queues 
stable. Constraint (26) is the battery QoS constraint of the 
desired battery discharge rate 77". 

Similarly, we define virtual queues for all n e N 
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(i+i) = ((cw-<) + + E m-r n it) 

ien n u@ n 

+ M n (t) + i:(t)Y ", (27) 
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where M n (t) = - Ei e n n ue n W + - + 

is the amount of missed replenishment and 

7™(i) = indicator that battery hits zero state in slot t 
for node n 

= | 0 if E« 6 n BU e B ^(*) <«"(*) 



1 otherwise 



Next, we generalize the main results and algorithms RC and 
PA to the multihop scenario. 

Corollary 1: If all the virtual battery queues q% (t), VneN 
are strongly stable, we have p™ < 77™, Vn e IN". 
The proof is identical to the single hop scenario and can be 
found in Appendix C. We give the algorithm in the following 
section. 

V. Joint Rate Control, Power Allocation and 
Routing Algorithm for Multihop Networks 

The joint rate control, power allocation and routing algo- 
rithm for multihop networks can either be implemented in a 
centralized or distributed manner. For the centralized solution, 
we use the classical Maximal Weighted Matching (MWM) 
based algorithm and for the distributed algorithm, we can use 
the Maximal Matching (MM) based algorithms as in [18] [19]. 

Multihop Rate Control (MRC): 
Depending on whether Maximum Weight Matching or Maxi- 
mal Matching is employed by the scheduler, there is a slight 
difference in the implementation of MRC: 
Maximum Weighted Matching (MWM): If q^' e (t) < \ , node 
n chooses to sense all the available data packets, i.e., R„(t) = 
A^(t); otherwise, reject all the arrivals, i.e., Rf^t) = 0. 
Maximal Matching (MM): If q^ e {t) < V , node n chooses 
to sense all the available data packets, i.e., R^it) — ^(t); 
otherwise, reject all the arrivals, i.e., R^{t) = 0. 

Multihop Power Allocation (MPA): 
Here the goal is to ensure that no node transfers data of a 
flow to a relay node that is not the destination of that flow, 
unless the differential backlog for that flow is greater than a 
fixed value 7 > 0. We will choose the value of 7 such that the 
resulting backlog of the receiving node is not larger than that 
of the transmitting node after the transmission. This pushes 
the data flow from the source to the destination with a positive 
back pressure. Let tran(Z) and rec(/) denote the transmitting 
and receiving node of link I, respectively. We first define 



7j 



7 if rec(Z) 7^ e 
0 otherwise 



Let ei(t) = 
be the flow 



where 7 > 0 is some constant. 

f tran(;).e/,-, rec(i),e/,\ ( 

argmax e jq d (t) - q d (t) - 7 f 
on link I that has the maximal modified differential backlog 
and w t {t) = max W™^'^ (t) - q™ {l) > e ' {t) (t) - 7i ei(t) ,0 
is the nonnegative differential backlog of I at time t. 
For each link I, solve 



max 

Pi(*)en, 



iiv \ j 



where U t (t) = |P;(t) : 0 



< 



< 



mm 



tran m 



Let Pf(t) be the solution for link I. With the calculated 
power Pf{t), let W t {t) = Wl (t)^( Pl s (t)) 

^ ec(0 (t))p, s (t) be the weight on link I. 

For the whole network, we either use the MWM or MM 
as described below. We will analyze the performance of our 
solution when using each algorithm. 

Maximum Weighted Matching Algorithm: link I has weight 
Wi(t), then the weight of a matching M is Wja(t) = 
E/gm^K*)- The network chooses a maximum weighted 
matching in a centralized manner, the links in the cho- 
sen matching become active with the calculated transmitting 
power, and other links are not activated. 

Maximal Matching Algorithm: the network calculates a max- 
imal matching that achieves at least half of the total weight 
of MWM in a fully distributed manner as in [18] [19]. The 
links in the chosen matching become active with the calculated 
transmitting power, and other links are not activated. 

Multihop Routing: 

When Wi(t) > 0, transmit for flow that is destined to 
ej(t) with rate w(P/(i)), i.e., ,(t) (fl(t)) = w(P;(t)) and 
Hf(W)) =0, Ve^ e/ (t). 

Note that MRC and routing can be done by each node inde- 
pendently. We then give our main theorem for the multihop 
scenario: 

Theorem 2: If 

(1) is concave on 9? + 1J{0}> an d i ts sl°P e at 0 satisfies 

0 < 0 = mK°) <oo,yie £, 

(2) Vn e DST: 0 < r n {t) < r max , Vi > 0, 

then the maximum weighted matching based joint rate 
control MRC, power allocation MPA, and routing algorithm 
achieves: 



«?(*)< /?(^+^max), 
n ( t=0 e ) 

> ]T jliminf I J2 K(t) vZiHr^ + 0) 



(29) 
(30) 



-0,(1^,-0,1,, 



V 



(31) 



(28) 



and the maximal matching based joint rate control MRC, 



7 



power allocation MPA, and routing algorithm achieves: 

qd' e (t)<V + A m , x , 
«?(*) <P(V + A max ), 



(32) 
(33) 



t=0 e 



>y jiiminf- yV 

n \ t=0 e 



2 



-<(/W + /3) 



(34) 



The proof of Theorem 2 can be found in Appendix D. The 
results can be interpreted similarly as Theorem 1. 



compare the results with the optimal value 3 . From Figure 5 
(a), we see that as V increases, the average total sensing rates 
of the MWM and MM based algorithm keep increasing and get 
closer to the optimum and a value that is much larger than half 
optimum, respectively. This is consistent with Equation (31) 
and Equation (34). From Figure 5 (b), we see that as V 
increases, the average data queue length (we here only plot the 
data queue length of node 3 for flow 1 due to space limitation) 
keeps increasing but is upper bounded by the bound we get 
in Equation (29) and Equation (32). This means the queueing 
delay increases as we improve the sensing rate, which can be 
viewed as a tradeoff. From Figure 5 (c) we observe that the 
battery discharge probability (we only plot for node 5 here) 
increases to the threshold as V increases. 



VI. Numerical Example 

We consider a network topology, shown in Figure 4(a). 
There are 6 nodes, 7 links, and 2 flows with source-destination 
pair (3,1) and (5,2), respectively. In all simulations, the 
simulation time is T = 10 6 time slots. We use the rate power 
function /Lt;(P;) = 101og 2 (l + ^) packets/slot V/ G £. Let 
the power of the background noise JVj = 1.6 x 10 — 14 W", V/ G 
L, and the channel gains gi = 1.6xl0 -13 , V/ G £. Each node 
is equipped with an infinite data buffer for each flow through 
it. The number of arrivals A\{t), t > 0 and Af(i), t > 0, 
are modeled as independent Poisson random variables with 
mean A = 20 packets/slot and A max = 30 packets/slot. We 
set 77™, the threshold of battery outage probability to 0.03 
for all n G N and the peak power P pea k = 1.5 W. The 
backlog threshold 7 = 80 > max„ !e fe re c(i)=n Mi + ^n) = 
2 x 101og 2 (1 + lOPpeak), so that the resulting backlog of the 
receiving node is not longer than that of the transmitting node. 



source 1 




destination 2 



Fig. 4. (a) Network topology and (b) a sample replenishing process 

Scenario 1: We first use a replenishment process which is 
formed by a periodic deterministic sine waveform (r max = 1.2 
and period 8000) plus independent Gaussian noise with zero 
mean and variance 0.01, as shown in Figure 4(b) (The cycles 
imitate the daily solar cycles for a solar battery and the average 
replenishing can be simply calculated f = 0.2). All the battery 
buffer sizes are set to be B b = 100 J. We simulate both MWM 
based and MM based algorithms. We choose different values 
of the control coefficient V for the proposed algorithm and 
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Fig. 5. Performance of the MWM and MM based algorithms. Impact of the 
control parameter V on (a) the average total sensing rate, (b) average data 
queue length, and (c) the battery discharge probability for Scenario I . Impact 
of battery size on (d) the average total sensing rate for Scenario 2. 



Scenario 2: We use different replenishing processes: ra(i) 
and rs(t) are i.i.d Bernoulli random variables Bernoulli(0.5) 
(i.e., ?"2(£) = 1 w.p. 0.5 and T2(t) = 0 w.p. 0.5); replen- 
ishing at all other nodes are independent Bernoulli random 
variables 0.2 x Bernoulli(0.5) in even number slots, and 
0.6 x Bernoulli(0.5) in odd number slots (f2 = ^5 = 0.5 and 
f = 0.2 for other nodes), all plus Gaussian noise with zero 
mean and variance 0.01. This replenishing process is faster 
time-varying than the one in Scenario 1. We simulate 3 three 

3 The exact optimal objective value for Problem (B) is hard to obtain, 
so we here use an upper bound for the optimum. For this example, an upper 
bound for the optimum can be obtained by equal time sharing of schedules 
{1, 4, 7} and {1,5,6}, and utilizing the link rate fi(f ) under infinite battery 
size and no discharge constraint. In Scenario 1, the optimum is 2fj,(f) = 30. 
In Scenario 2, since fi(r2) = fJ.(rs) > A, the optimum is A + fJ.(f-j) = 35. 
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different battery sizes B b = 100J, B b = 10J and B b = IJ 
(all nodes has the same battery sizes) for the MWM based 
algorithm. From Figure. 5 (d), we can see that the sensing 
rate increases as battery size increases. However, as long as 
the battery is large compared to the average replenishing rate, 
the improvement diminishes with increasing battery sizes. 

VII. Conclusion 

In this paper, we studied the problem of energy management 
in rechargeable wireless sensor networks. Our objective was 
to maximize the average data sensing rate subject to QoS 
constraints on both data and battery queues. We provided a 
simple and unified framework of joint rate control and power 
allocation for all combinations of finite and infinite data and 
battery buffer sizes. We showed through both analysis and 
simulation that the performance of our strategy is close to 
that of the optimal solution. We extended our algorithm to 
the multihop scenario and showed that simple extensions of 
our index schemes for single hop generalize to the multihop 
scenario with a similar, close-to-optimal performance. We 
developed a distributed joint rate control, power allocation 
and routing algorithm for multihop networks under node- 
exclusive interference model. Although we do not consider 
channel variations in this paper, taking channel variations 
into consideration is technically straightforward. Our multihop 
network formulation does not consider the fairness among 
links, however, the opportunistic scheduling framework [20] 
used to resolve the fairness issue can be applied in this context. 
For future work, we are extending this framework to the 
general interference model where the rate power function is 
not concave. 
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Appendix 
A. Proof of Proposition 1 

Using the idea similar to [16], we have the fact that 
if any queue represented with Q(t) is strongly stable, 
then limsupy^^ Q^P- = 0. Hence, if qd(t), q b (t) and 
q b (t) are strongly stable, then limsupj^^ = 0, 

limsup^^ = 0 and limsup^^ = 0. 

From Equation (11), we have q d (t + 1) > q d (t) — rj d R(t) + 
D(t) - D(t) - fJ,(P(t)) + R(t) + I(t). Note that q d (t + 1) = 
q d (t) - (J,(P(t)) + I(t) + R(t) - D(t). By adding from 0 to 
T — 1, dividing by T and taking limsupj^^ on both sides, 
we have 



km sup > km 

T— >oo T ~ 



m + lim 

T T^oo T 



Qd(0) 



+ 



T-l 



kmsupi^T \D(t) 



VdR(t) 



gd(0) 
T 



Since limsupy^^ = 0, liiriT-> 

and Hm r ^^M = 0, so 
limsupr^ i T^J 0 [D{t) - < 0. Thus, 



we 



0, 
get 



Pd - Vd 



i.e., p d < Vd- 



■■ km sup 

T^oo 



= km sup 

T^oo 



Vd 



<o, 
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Similarly, from Equation (12), we have q b (t + 1) > q b (t) — 
r lo + P(t)-r(t)+M(t) + l 0 (t). Note that q b (t + 1 ) = q b (t ) - 
P(t) + r(t) - M(t). By adding from 0 to T — 1, dividing by 
T and taking limsupy^^ on both sides, we have 



lim sup — — 



1 T_1 

" T^kx) ~ Vo + H ™ SU P T 51 7 °(*) 



«fc(0) 



T^oo 



4=0 



+ lim 

T^oo 



eft(0) - <r,(T) 



0, limT^^M 



Since limsupy^^ 

and limr^oo ^(°)-9»t T ) = o, so we get Po = 
limsupy^ ^ Y,l=o ] o(t) < Vo- ■ 

B. Proof of Theorem 1 

Proof of Equation (15): Note that I(t) < /z(P(t)) and 
R(t) < A m . dx . The rate allocation unit RC is chosen to satisfy 
Equation (15). 

Proof of Equation (16): Since p(-) is concave on 3? + 1J{0}, 
we have /x(P(i)) < /z(0) + /?P(t) for P(t) e II(i), Vt > 0, 
where 0 < /3 = ^'(0) < oo. Then, Q d {t)^{P{t)) - 
q b (t)P(t) < Q d {t)n(ti) + (3Q d {t)P{t)-q b {t)P{t) where P{t) 
is the solution of PA. 

lf(3Q d (t)P(t)-q b (t)P(t) < 0, then we get Q d (t) M (P(t))- 
q b (t)P(t) < Q d (t)fx(0). However, PA chooses P(t) that maxi- 
mizes Q d (t)fi(P(t))-q b (t)P(t) which means Q d (t)fj,(P(t))- 
q b (t)P(t) > Q d (t)/i(0) since 0 e II(t). Then we must have 
PQd{t)P{t) - q b (t)P(t) > 0, i.e., pQ d {t)P(t) > q b (t)P(t). 

We now prove Equation (16) by induction. Without loss of 
generality, let q b (0) < /3(y + A m . dx ). Suppose for all t > 1, 
<7h(i - 1) < + Anax) holds. In slot t, if P(t) > 0, then 
Qblt) < 0Q d (t) < P{\ + A max ). Otherwise, P(t) = 0, from 
Equation (10) and the assumption r(t) > 0, Vt > 0, we have 
7 G (t) = 0. We also note that M(t) < r(t), then q b (t) < 
q b {t - 1) < /3(| + A max ). Therefore, < $(\ + A miix ) 
for all t, which is Equation (16). 

Proof of Equation (17) and Equation (18): If p a = 0, we 

are done. In the following analysis, we assume p D > 0. 

From Equation (16), we already have limsupj^^ ^P- = 
0. In order to apply Proposition 1, we only need to show 
limsupy^ ^p- = 0. 

Claim: For any t > 0, there exists t'(t) > t such that 
q b (t' + 1) = r(t') and t' — t < oo. Suppose there exists 
t > 0 such that for any t' > t with q b (t' + 1) = r(t'), 
t' — t = oo. This means that < oo such that q b (t + 1) > 
r(f), Vt > TV, i.e., 7 0 (t) = 0, Vt > TV. Thus, Po = 

iimSUPT^oo y Ef^O 1 = limsu PT^oo ^ T,t=o 7 °W = 

0, which leads to a contradiction. 

Since r(t) < r max < oo, Vt > 0, and q b (t) - q b (t + 1) < 
P(i) < Ppeak- Combining with the above claim, we have 
q b {t) < r max + (t'(t) - t)P peak < oo, Vt > 0. Therefore, 
limsupy^ ^ = 0. 

Equation (17) then follows from Proposition 1. 

Equation (18) directly follows from Equation (15) Proposi- 
tion 1. 



Proof of Equation (19): 

I) B d = oo and B b = oo. 

We define the Lyapunov function L(q d (t),q b (t)) = q d (t) + 
ql(t), and A(q d {t), q b (t)) - L(q d {t + l),q b (t + 1)) - 
L(q d (t), q b (t)), where q d (t) and q b (t) are the resulting queue 
dynamics of our algorithm given a sample path. Note that 
our problem is a sample path optimization problem, so 
L(q d (t),qb(tj) is defined without conditional expectation. 

From Equation (13), we have q b (t + 1) < (q b (t) — r/ 0 ) 2 + 
(7 D (t) + P(t) - r(t)) 2 + 2{q b {t) - r? 0 )+(7 0 (t) + P(t) - r{t)). 
Also from the data queue dynamics, we have q^[t + 1) < 
«d(*) + M 2 (P(^)) + R\t) + 2%(t)P(t) - 2q d (t)»(P(t)), then 

A=A(%(t),g 6 (t)) 

< M 2 (P(t)) + P 2 (t) + (W + 77 0 ) 2 - 2q d {t)p{P{t)) + 
(1 + Ppeak) 2 + 2q d (t)R(t) + 2q b (t) (7 0 (t) + P(t) - r(t)) 

<(1 + Ppeak) 2 + (rmax + »?o) 2 + Mmax + + VR(t)+ 

2q b (t)(l 0 (t)-r(t))+2[q d (t)-V/2]R(t)- 
2[q d (t)p(P(t))-q b (t)P(t)}. 

It is apparent that RC is trying to minimize the term 
[q d (t) — V/2]R(t), and PA is trying to maximize the value 
of the term [q d (t)fi(P(t)) - q b (t)P(t)]. Since the optimal 
solution for Problem (A) may not be unique, we let V* be the 
optimal solution set and P* e be any optimal solution, for 
Problem (A) given any sample path. Since the constraint set 
II(t) is queue dynamic related, it is possible that P*(t) ^ n(t). 

Lemma 1: If by solving Equation (14), we get 
[Q d (t)v(P(t)) - q b (t)P(t)} < [Q d (t)»(P*(t)) - 
q b (t)P*(t)J, then P(t) = q b (t) and 7 0 (t) = 1. 
Proof: In time slot t, let P m (t) be the value that maximizes the 
unconstrained objective function Q d (t)p(P{i)) — q b (t)P(t). 
Claim 1: q b (t) < P pea k- Otherwise, II(t) = [0,P peak ] 
which is not queue dynamic related, then Q d (t)p(P(t)) — 
q b (t)P(t)} > [Q d (t)p{P*(t)) - q b (t)P*(t) . 
Claim 2: P m (t),P*(t) > q b (t). If P m (t),P*(t) £ U(t), we 
must have [Q d {t)fi(P{t)) - q b {t)P{t)] > [Q d (t)^(P* (t)) - 
q b (t)P*(t)], then P*(t) > q b (t). If P m (t) < 0, we 
will have Q d (t)M0) - [Q d (t)p(P(t)) - q b {t)P{t)] > 
[Q d (t)n(P*(t)) - q b (t)P*(t)], then P m (t) > q b (t). 

By the above claims, P(t) < P*(t) and P(t) < P m (t). 
Suppose P(t) < q b (t), since the objective function of PA is 
concave in P(t) and P m (t) > <7t(t), we can increase P(t) to 
increase the objective without violating the constraint. Thus, 
P(t) = q b (t) and 7 0 (t) = 1. ■ 

By Lemma 1, we have 



max + ^max + VR(t)- 



A <(1 + P peak ) 2 + (r max + 7]o) 2 + P 

VR*{t) + (V + 2A m . iX )(p + p m . dX )l Q (t)+ 

2q d (t) [R*(t) - p(P*(t))] + 2q b (t) [P*(t) - r(t)] . 

(35) 

The following lemma gives the sufficient condition for 
strong stability under sample mean argument. 

Lemma 2: For any queue with evolution q(t+ 1) = (q(t) — 
Ai(t)) + + A(t), if g(0) < oo, n{t) < ,u max , Vt > 0 and 
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limsup-j,^ ^ Et=o ( A W ~M*)) < °> then is strongly 
stable, i.e., limsupy^^ ^ Et=o < 00 ■ 
Proof: Without loss of generality, assume q(0) = 0. Let 
T 0 = 0, Ti = min ji > 0 : g(t) < pt(t)|, T 2 = min |i > 

T\ : < T fc = min {i > T fe _! : q(t) < 

},..., and so on. Let N k = T k - T k -i, Vfc > 1. 
We first claim N k < oo, Vfc > 1. Suppose there ex- 
its 1 < k < oo such that N k = oo, then it means 
Er=T fe _! ( A (*) - + !)) > 0, VT > T fe _i, and as a result, 
liminfT^oo ^ Et=T fc _ 

x (A(i) - fj,(t + 1)) > 0. However, 

T-l T fe _i-1 

= E( a (*)-m*))- E (aW-mW) 

i=0 t=0 

+ M(r fc _i) - Mr) 

/ * In each of the fc segaments, the sum of the arrival 
departure difference becomes negative once, and the 
negative drift is no smaller than — yU max * / 

T-l 

< E ( A W - M*)) + fc A*max + /i(Tfc_i) - /i(T), 



t=0 



then, 



T-l 



T-l 



E (A(t)-M* + i))<E( A (*)-M*)) 



t=T fc 



t=0 



+ fcMmax + M(7fc-l) -/i(T). 

By dividing by T and taking limsupy^^ of both sides, 
we have 

T-l 

liminf- E (A(t)-M* + 1)) 

*=T fe _i 
1 T_1 

< lim sup- E (*(*)- + 

c— J fc-1 

1 T_1 

<Iimsup-E(M*)-M*)) + 



T^oo 



T 

t=u 



lim sup 

<o, 

which is a contradiction. 

Let N = maxfc>i N k , then N < oo. This im- 
plies q(t) < N[i m . dX} Mt > 0. Now we suppose 
limsupy^^ ^ J2t=o = 00 • This means there exists 

a subsequence of times {£„} such that t n — > oo and 

t^Et=o 1< ?(*) °°- In other words ' VQ > 0, 3 T(Q) e 
{i n } such that Vtfe e {i n } and t k > T(Q), we have 
T~ Et=o > <3- Let Q — Nfi max , and find the corre- 

sponding T(Nfi m , x ) such that T{N \ ma) Efi^"" 5 " 1 <?(*) > 



-A^Mmax- Then there must exists 0 < T < T(iV^ max ) with 
<z(T) > N[i m . dx which leads to a contradiction. Therefore, 
limsupy^ L Ef^o* < 00 • ■ 
Lemma 3: 



T-l 



i limsup i E Ut) [P*(t) r(t)] < O(i). 



T^oo 



t=0 



Proof: Note that 

T-l 



T-l 



t=0 



t=0 



<0, 



then 



T-l 



E(P*(*)-r(t)- C ) <g b (0), 



t=o 



where e > 0 can be arbitrarily small. By divided by T and 
taking lim supj^^ of both sides, we have 

1 T ~ 1 

limsup -E (P*(t)-r(t)-e) < 0. 
Construct an auxiliary queue with the following evolution: 

%(t+l)= (q* b (t)-r(t)-e) + + P*(t), 

where e > 0 can be arbitrarily small. By Lemma 2, ql(t) 
is strongly stable. By multiplying %{t) for both sides of the 
inequality (t + 1) > ql (i) — r (t) — e + P* (t) and rearranging 
terms, we obtain q b (t) [P* (t) - r(t)] < q b (t) \<%(t + 1) - 



. By summing from 0 to T — 1, dividing by T and 



taking limsupy^^, we have 

T— 1 

y lim sup i E 96 W (*) - r (*)] 

1 1 T 

<— lim sup - E 96 W !) - 9b(*))- 



T^oo 



llimsupM^Mn^MM 



. ,. 1 9bW 

+ limsup - > —— e 



1 1 V 

< vy(Vo + r max ) limsup - E«t (*) + 17(^7 + A max) 

max \ 

since the average queue length of the auxiliary queue remains 
finite no matter how large the system parameter B b becomes 
and is not related to the algorithmic parameters V. By letting 
e — > 0, we finish the proof. ■ 

Note that q* d (t + 1) = (q* d (t) - M (P*(t)))+ + R*(t) > 
q*d( t )~K p *( t )) + R*( t )- B y multiplying both sides with q d {t) 
and rearranging terms, we obtain qd(t)(R* (t) — n(P*(t))) < 
qd{t){q* d {t+l)-q d (t)). With the optimal policy, q* d is strongly 
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stable. By summing from 0 to T— 1, dividing by T and taking 

limsupy^^, we have 

1 T_1 

U ^P f E W - (*))] 

l T 

limsup - V q* d {t){q d {t - 1) - g d (i)) 



iii) if l 0 (t) = 1, then <ft(t + 1) = min[r(f), £?;,] by definition 
of discharging event Equation (10). If M(t) > 0, then this 
means r(t) > B b and -r(t) + M(t) = -B b ; 

iv) if q b (t) = 0, then q b (t)M(t) = 0. 

Combine the above discussion with Equation (16), we have 
that Vt > 0, if M(t) > 0 and q b (t) > 0, we must have 
ibit) < (3(^ + A max ) + max[r max , 1] - B b . Thus, 

T-l 



<Mmax lim sup -^>q* d (t) , 



(36) 



which is a finite constant that is not related to V. 

By summing from 0 to T— 1, dividing by T and V, taking 
limsupy^^ over Equation (35), combined with Lemma 3, 
and Equation (36), we get 



T-l 



T-l 



liminf i V J2(t) > liminf I V R*(t) - ^(/w + /?) 



t=o 



T^oo T 

-O(i). 
II) B d = oo and B b < oo. 

From Equation (12), we have q b (t + 1) < ~ ?7o) 2 + 

(J„(t) +P(t) -r(t) + M(i)) 2 + 2(&(t) - Vo ) + (l 0 (t) + P(t) - 
r(t) + M{t)), then 

A=A(q d (t),q b (t)) 

<(1 + Ppeak) 2 + (Vo + r max ) 2 + V 2 {P{t)) + R 2 {t) + W2(t) 

+ 2[q d (t) - V/2]R(t) -2[q d (tMP(t)) - q b (t)P(t)] 
+ 2q b (t)(l 0 (t)-r(t)+M(t)) 

<(1 + -fpeak) H" (^?o H" ^max ) 2 + Mmax + + 2q~b(t)M(t) 

+ V(R{t) - R*(t)) + 2q d {t) [R*(t) - /i(P*(t))] + 2&(t) 

[P*(t) - r(t) + M*(t)] + (V + 2A max )(/3 + Mmax)7o(t) 

Lemma 4: 



i limsup i^>(t)M(t) < 0( ( ^ + 
T ~ > °° t=0 



)• 



^ limsup JVft(t)M(() 



t=o 



< 



+ A max ) + max[r max , 1] - P 6 )+r„ 



The remaining argument is similar to case I), and we obtain 

^ T-l ^ T-l 

1 — >oo / •* — ' 1 — *oo / ' — ' 



t=0 



t=0 



_ 0( ^_o ( (i^,. 

Ill) B d < oo and B b = oo. 

From Equation (13), we have q£(t + 1) < (g'b(t) — ?7 0 ) 2 + 
(7 0 (t)+P(t)-r(t)) 2 +2(q b (t)-?7 0 )+(7 0 (t)+P(t)-r(t)).Also 
from Equation (11), we have qj(t + 1) < (g d (i) - ri d R{t)) 2 + 
(I(t) + R(t) - ^P(t))) 2 + 2(&(t) - /7 d i?(t))+(/(t) + fl(t) - 
H(P(t))), then 

A=A(« d (t),g t (t)) 

<(1 + Ppeak) 2 + (Vo + W) 2 + M 2 (^(^)) + (1 + V 2 d )R 2 (t) 

+ VR(t) + 2q b (t)(l 0 (t) - r(t)) + 2g d (t)J(t) + 2 



(l - Vd)qd(t) 



R(t) - 2 q d (t)t,{P(t)) - q b (t)P(t) 



I 2 

max 



Proof: We first provide a rough idea of the proof: by exploring 
the relations between q b (t) and q b (t), we notice that as q b (t) 
increases from 0 to B b , q b (t) will decrease below (f (V + 
^4max) — B b ) + at some slot. We now give the proof details. 

Without loss of generality, let q b (0) = 0. We have the 
following cases: 

i) if P(t) > r(t), l a (t) = 0 and q b (t) > 0, then M(t) = 0, 
q b (t + 1) < q b (t) and q b (t + 1) - q b (t) < q b (t) - q b (t + 1), 
i.e., even if q b (t) increases, the increment is no larger than the 
decrement of q b (t); 

ii) if P(t) < r(t), l a (t) = 0, then if q b (t + 1) > 0, , , , 
q b (t) - q b (t + 1) = r{t) - P{t) - M{t) - t] 0 + (t] 0 - li mS up- V q d (t)I(t) 
q b (t)) + > q b (t + 1) - q b (t) = r(t) - P(t) - M(t), i.e., the ' ' ^ 
decrement of q b (t) is no less than the increment of q b (t), else 
if q b (t + 1) = 0, it goes to case iv); 



<(1 + Ppeak) 2 + {Vo + r ma x) 2 + Mmax + (1 + Vd)^ 

+ 2q d {t)I{t) + (V + 2A m . dX )(f3 + Mmax)io(i) + VR(t) 
- VR*(t) + 2q d (t) [R*(t) - M (P*(i)) + P(t) - D*(t)] 
+ 2q d (t)[D*(t) - Vd R*(t)] + 2q b (t)[P*(t) - r(t)] 

Without loss of generality, let q d (0) = B d . Similar to the 
proof of Lemma 4, we have the following cases: 

i) if D{t) > 0, then from q d (t + 1) = (q d (t) - n(P(t)))+ + 
R(t) - D(t) and I(t) = (fi(P(t)) - q d (t))+, we know that 
R(t) > B d when I(t) is strictly positive. Further, whenever 
D(t) >0, q d (t+l) = B d ; 

ii) if D(t) = 0, from q d (t + 1) = q d (t) - n(P(t)) + J(t) + 
R(t) - D{t) and q d (t + 1) = ((q d (t) - T] d R{t))+ + D{t) - 
li{P{t)) + I(t) + R(t) - D(t))+, q d (t) decreases no slower 
and increases no faster than q d (t) until q d (t) hits zero. 

Note that only when q d (t) < ^i max , I(t) may be strictly 
positive. Then combine with the above discussion, we have 



T^oo 



T-l 



t=0 



< 



( 2 ^-max + A*max Bd)^ 



V 



B d 



V 
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Similar to Lemma 3, we can obtain 

T-l 



limsup i J2 Ut)[D*{t) - < 0(1). 



V 



t=o 



The remaining argument is similar to case I), and we obtain 

j T-l j T-l 

liminf - R(!) > h ; minf ™ E fi * W - ^(^ax + 0) 



t=0 



t=0 



0(^)-0( 



^ 2 



B d ) + 



V 



)• 



IV) Bd < co and Bf, < oo. 

Simply by combing case II) and case III), we obtain 

^ T-l ^ T-l 

liminf - V R(t) > liminf - V R*(t) - r/ 0 (u max + (3) 



t=o 



t=0 



0(^)-0( 



( z 



V 



■)• 



C. Proof of Corollary 1 

Similar to the proof of Proposition 1, we have the fact 
if q£(t) is strongly stable, then limsupj^^ = 0. 

From Equation (27), we have qg(t + 1) > (i) — «™ + 
E; e n„ u e„ ^(*) - »"„(*) + M n (t) + l n 0 {t). Note that # (t + 
1) = 9? W-Ei 6 n B ue„ fl(*)+r„(t)-M n (t). By summing 
from 0 to T — 1, dividing by T and taking limsup of both 
sides, we have 

limsup > lim _ n ™ + l imsup I J2 l n 0 (t) 



+ lim 

T-»oo 



Since limsupy^^ = 0, lim-r^ ^1 = 0, 

and liniT^oo ^M^g) = 0 , so we get = 
lim sup^ i E^o 1 7 " (*) < C , Vn e N. ■ 
D. Proof of Theorem 2 

Proof of Equation (29) and Equation (32): We prove Equa- 
tion (29) by induction. Let <7™ ax (i) be the maximum data queue 
length for all flows at slot t. Assume that g™ x (£) < \ + A mdx 
(holds for t = 0 by letting <^' e (0) = 0, Vn, e e N), need 
to show that it holds at slot t + 1. Consider the data queue 
q d ' e (t + 1) maintained at any node n for flow destined to any 
node e^nat slot t + 1. If node n received data destined to e 
from other node m at slot t, then by the routing policy in Sec- 
tion V and definition of w (m ,„)(i), q d n ' e (t) - q™' e (t) > l e (m n) , 
where (to, n) is the link from node to to node n. Choose 7 
such that the resulting backlog of the receiving node is not 
longer than that of the transmitting node (let n™. dx to be the 
maximum endogenous arrivals, then 7 = uJJJ^ + A m . dx satisfy 
this condition), we then have q^' e {t + 1) < q^ e {t) + 7, then 

q n /(t + 1) < qT e (t) < lT X (t) < \ + ^max- If node n 
did not receive any data destined to e from other nodes, then 



it can only have exogenous arrivals. Clearly q d ' e (t + 1) < 
q2' e (t) < \ + A mdx if there were no exogenous arrivals. 
If there were exogenous arrivals, by MRC of Section V, we 
must have q d ' e (t) < \, then q d ' e (t + 1) < \ + A mdx . Thus, 
Equation (29) holds. Equation (32) can be shown using the 
same argument. 

Proof of Equation (30) and Equation (33): Consider any 

link I*. Since w-(IMt)) < W* (°) + $ P l* (*) for e 
II/- (t), yt > 0, we have 

Wl .(t)MPt*(t)) (C (r) (*) + C c(r) )^W 

<«;,.(t) W .(0) + /9«;,.(t)fl.(t) - (C^W +C C(r) )^*W. 



where P;» (f) is the solution of Equation (28). If 

/?«;,. (t)Pi.(t) - (C (r) W + C (r) )^*W < 0' 
then^ we get «,,.(i) W .(fl.(t)) - (C^*^) + 
~r ec C <; w/» (0). However, solution 

of Equation (28) chooses P;» (i) that maximizes 



(qT n(r '(t) 



~rec(/*) 

9 6 



which 



means «,,. (t) w . (P,, (t)) - (^'^(t) + ^''^.(t) > 
wi*(t)/j,i*(0) since 0 G II;. (t). Then we must have 



(C n(H (*) + C (r) )^W < 0wi*(t)Pi*(t). (37) 



We now prove Equation (30) by induction. Without loss of 
generality, let q%(0) < (3(^ + A miix ), Vn e K. Suppose for 
all f > 1, q£(t - 1) < (3(^ + A mdx ), VneN holds. In slot t, 
if all the incident links of any node n is assigned zero power, 
then qg(t) < (% {t-l) < /3(f + A m . dx ). Otherwise, let I* be an 
incident link assigned non-zero power, and by Equation (37), 
we have g£(t) < /?«;,.(«) < /?C x (i) < + A max ), Vn e 
[NT. Equation (33) can be shown using the same argument. 

Proof of Equation (31): Define L(q d (t),q b (t)) = 
E„, e (?rW) 2 + E„ («T(*)) 2 . then 



A(t) =i(^(t + l),%(t + l)) -i(gi(t),«6(t)) 

< J] + ^max) 2 + "max] + ]T [(1 + Ppeak) 2 + 

n, e n 

+ r max ) 2 ] + 2 2 qT(t)K(t) + 2 ^ ^ (*) 

n,e n 

7«( i )-r„(t) + M„(t)]-2^[(C n(i) ' e!(t) W- 

g ~(0.e.(t) (t))w(pi(t)) _ ( -«a„(0 W+ -ec(0 w)p;w 



Note that if ^an(0,e !( t) w _ ^ 



rec(0,e((t)^-j ^ ^,e ; (t) 

0 and fJ.i(Pi(t)) = 0, then 



then wi (t) 

( q T {l)Mt \t) - 9r (,),e,(t) w)w(^(*)j' = wiWmm)); 

otherwise, g^O.eiW ^ _ ffWMQ^ > then 



/ IT 



tran(i),ei(t) 



(t) - gr (0,e,(t) (*))w(^W) > ^(*)W(^W)- 
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We then have 

max + A*max) + A*max 

n,e n 

{€ + W) 2 ] + 2^ qS(t) [r Q {t) - r n (t) + M n (t)] ~ 

n 

n,e 

Note that E^NWwW)) - (C^W + 

- E leM(f )h(t)wW(t)) - («r (0 (*) + 
C c(0 (W(*)] > E; eM . w NWw(^W)-(C (i) W + 

qf^) where M(t) is the matching chosen by 

MWM algorithm and M*(t) is the matching picked by 
optimal policy in slot t, and Pf(t), V7 e £ are the 
suppositionally calculated power in MPA. Since the objective 
function of power allocation component is separable over 
links and for each link it is concave, using similar argument 
as in Lemma 1, we have 

max + A*max) + A*max ] + £ [(! + -Ppeak) 2 + 

n,e n 

(Vo + W) 2 + (V + 2^ max )(/3 + ^)l n 0 {t)] + 



vE^) + 2 £ 



n.e /,\ 

i d (*) 



v 

2~ 



£ (C (0ie (i) - i7 {lu (t) - iM(pnm + 2 £ 

#(*) £ *T(*)-»-n(t) + M*(t)+M n (t) 
zen„u e„ 

max + 7 + Mmax)^ max 

] +E t( y+ 

n,e n 
2A max )(/3 + Mmax)^(t) + (1 + Ppeak) 2 + « + T'max J 

+ ^E (kw - K*(t))+2j2%(t)M n (t)+ 



2EC ,e (*) 

n.e 

+ 2 £&"(*) 



^*(*)+ £-£ koto) 



E Pt(t)-r n (t) + K(t) 



(38) 



Similar to Lemma 3, we have 

T-l 



iiimsupiEE^w E ™ 

T ^°° t=o n L ieo„|je„ 

-r„(t) + M*(i) 



Similar to Lemma 4, we have 



T-l 



i7 h T ^ f ^ £ £ ffWMnW < E °( 



t=0 n 



V 



Note that the data queue dynamics can be written as 

?n* +i) =# e (*) E mfw))- E 

^(P i (*))-^' e (*)+^ut e W, 

where -D"' e (i) is the amount of overestimated endogenous 
arrivals to the queue q^ e since Eiee„ A*f (-^H*)) mav be 
larger than the actual endogenous arrivals; D"^(t) is the 
amount of overestimated departures since Ej£n n A*f (-^H*)) 
may be larger than the actual departures. Further, at slot t, 
if node n does not have endogenous arrivals for flow e, then 
D™ n ' e (t) = 0; if flow e is transferred from node n to node m, 
then D^(t) = D™ e (t). By choosing 7 = A max + ,u m;lx , the 
resulting backlog of the receiving node is no longer than that 
of the transmitting node. We then have 

£9n*x#~(*+i)-# e *(')) 



- AT W + + E /WW) 



E aww) 



>£?r w 



^*(*)+E^w(*))-E^( p *(*)) 

(ee„ zefi„ 



Since ^ ,e (i) - q£' e (t + l) < Mmax, Vt > 0, Vn,eeN and 
q2' e * are strongly stable Vn, e G K, then 



T-l 



< 



iim S up 1 ^ J2irm{c"(t+i) - #-(<)) 



EL" 



t=0 n,e 

fi,e/m\ n.e 



lim 

00 



3r(T) ^e* (r) _ 9r(0)9 n,e* (0) 



+ 



^ max limsup-E'?^ e *( i ) 



T^oo 



t=l 
T 



< E lim sup ^ £ c e * (*) ' 



(39) 



T^oo 



t=l 



which is a finite constant that is not related to V. Thus, 

we have £ limsup^ ± E^ £„ >e WK* (*) + 

£ Je e n /W(*)) - £ Ietln /W(*))] < 0(f). By 
summing from 0 to T — 1, dividing by T and V, taking 
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liminfV^oo for Equation (38), we get 



t=0 
T-l 



> 



£ Hminf ^ E E ^ (*) ~ ^» 



/3) 



t=o 



o( v ). 



which is Equation (31). 

Proof of Equation (34): Define L{q d {t),qb{t)) = 

iE„, e (9r(*)) 2 +iE„fe"(*)) 2 . 

By MRC, we have for any n, e E K, n 7^ e that 

[?rw-^nW = te' e (*)-^]^(i) < K ,e w- 

< M^W - V]R?{t) if g^W < V; and 

[#"(*) - = 0 < \ [# e (*) - V]K*(t) if #"(*) > 

V. Thus, 

# e (*) y] K(t) < \ [ q n d e (t) - v] K{t). 

From the efficiency of the distributed matching algorithm 
[18] [19], we have EietNWwW)) - (%* \ t ) + 

C {l \t))Pf(t)] >lE l eM*(t ) lMt)MP l s (t))-(qT nil) (t) + 
q^ityPfit)], where M(t) is the matching chosen by MWM 
algorithm and M*(t) is the matching picked by optimal policy 
in slot t, and Pf(t), VZ <G £ are the suppositionally calculated 
power in MP A. 
We then have 



A(t) =L(&(t + 1), &(t + 1)) - L(q d (t),q b (t)) 



< 



Ell 



max max + 



r max ) 



n 

*E#'(*) 



isfi„lje„ 




The remaining argument is similar to the proof of Equa- 
tion (31). ■ 



